A Genealogical Interpretation of Principal Components Analysis
نویسنده
چکیده
Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure in the distribution of genetic variation across geographical location and ethnic background. However, while the method is often used to inform about historical demographic processes, little is known about the relationship between fundamental demographic parameters and the projection of samples onto the primary axes. Here I show that for SNP data the projection of samples onto the principal components can be obtained directly from considering the average coalescent times between pairs of haploid genomes. The result provides a framework for interpreting PCA projections in terms of underlying processes, including migration, geographical isolation, and admixture. I also demonstrate a link between PCA and Wright's f(st) and show that SNP ascertainment has a largely simple and predictable effect on the projection of samples. Using examples from human genetics, I discuss the application of these results to empirical data and the implications for inference.
منابع مشابه
Faults and fractures detection in 2D seismic data based on principal component analysis
Various approached have been introduced to extract as much as information form seismic image for any specific reservoir or geological study. Modeling of faults and fractures are among the most attracted objects for interpretation in geological study on seismic images that several strategies have been presented for this specific purpose. In this study, we have presented a modified approach of ap...
متن کاملA new weighting approach to Non-Parametric composite indices compared with principal components analysis
Introduction of Human Development Index (HDI) by UNDP in early 1990 followed a surge in use of non-parametric and parametric indices for measurement and comparison of countries performance in development, globalization, competition, well-being and etc. The HDI is a composite index of three indicators. Its components are to reflect three major dimensions of human development: longevity, knowledg...
متن کاملPersian Handwriting Analysis Using Functional Principal Components
Principal components analysis is a well-known statistical method in dealing with large dependent data sets. It is also used in functional data for both purposes of data reduction as well as variation representation. On the other hand "handwriting" is one of the objects, studied in various statistical fields like pattern recognition and shape analysis. Considering time as the argument,...
متن کاملRobust Principal Component Analysis and Fractal Methods to Delineate Mineralization-Related Hydrothermally-Altered Zones from ASTER Data: A Case Study of Dehaj Terrain, Central Iran
The Dehaj area, located in the southern part of the Urumieh-Dokhtar magmatic belt, is a well-endowed terrain hosting a number of world-class porphyry copper deposits. These deposits are all hosted in an acidic to intermediate volcano-plutonic sequence greatly affected by various types of the hydrothermal alterations, whether argillic, phyllic or propylitic. Although there are a handful of hithe...
متن کاملEvaluation of psychometric properties of Persian Version of Social Comparisons during chronic illness scale
Background: When a person is under stressful circumstances, such as confrontation with a life-threatening disease, often make comparisons with each other, which can have effects on their psychological and physical well-being. This research aimed to evaluate the validity and reliability of Persian version of social comparisons during chronic illness scale. Methods: The study was descriptive wit...
متن کامل